From Data Fusion to Knowledge Fusion
نویسندگان
چکیده
The task of data fusion is to identify the true values of data items (e.g., the true date of birth for Tom Cruise) among multiple observed values drawn from different sources (e.g., Web sites) of varying (and unknown) reliability. A recent survey [20] has provided a detailed comparison of various fusion methods on Deep Web data. In this paper, we study the applicability and limitations of different fusion techniques on a more challenging problem: knowledge fusion. Knowledge fusion identifies true subject-predicateobject triples extracted by multiple information extractors from multiple information sources. These extractors perform the tasks of entity linkage and schema alignment, thus introducing an additional source of noise that is quite different from that traditionally considered in the data fusion literature, which only focuses on factual errors in the original sources. We adapt state-of-the-art data fusion techniques and apply them to a knowledge base with 1.6B unique knowledge triples extracted by 12 extractors from over 1B Web pages, which is three orders of magnitude larger than the data sets used in previous data fusion papers. We show great promise of the data fusion approaches in solving the knowledge fusion problem, and suggest interesting research directions through a detailed error analysis of the methods.
منابع مشابه
Urban Vegetation Recognition Based on the Decision Level Fusion of Hyperspectral and Lidar Data
Introduction: Information about vegetation cover and their health has always been interesting to ecologists due to its importance in terms of habitat, energy production and other important characteristics of plants on the earth planet. Nowadays, developments in remote sensing technologies caused more remotely sensed data accessible to researchers. The combination of these data improves the obje...
متن کاملFlood Forecasting Using Artificial Neural Networks: an Application of Multi-Model Data Fusion technique
Floods are among the natural disasters that cause human hardship and economic loss. Establishing a viable flood forecasting and warning system for communities at risk can mitigate these adverse effects. However, establishing an accurate flood forecasting system is still challenging due to the lack of knowledge about the effective variables in forecasting. The present study has indicated that th...
متن کاملSplenogonadal Fusion Operated as a Malignant Tumor
Splenogonadal fusion is a rare congenital malformation whereby the splenic tissue is found attached or surrounded by the testis, and presents in both continuous and discontinuous forms. Splenogonadal fusion may be misinterpreted as a primary malignant testicular or an adenomatoid tumor. Knowledge about the existence of such an entity is essential in order to preserve the testis during surgical ...
متن کاملFuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition
In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...
متن کاملComparative Evaluation of Image Fusion Methods for Hyperspectral and Panchromatic Data Fusion in Agricultural and Urban Areas
Nowadays remote sensing plays a key role in the field of earth science studies due to some of the advantages, including data collection at a very low cost and time on a very large scale. Meanwhile, using hyperspectral data is of great importance due to the high spectral resolution. Because of some limitations, such as hyperspectral imaging technology, it suffers from a reduction in the spatial ...
متن کاملThe Effect of Mindfulness Therapy on Tolerance of Uncertainty and Thought-Action Fusion in Patients with Obsessive-Compulsive Disorder
Background and Purpose: Obsessive-compulsive disorder is a serious disorder that affects psychological, communicative, social, and emotional processes. Accordingly, the present study was conducted with the aim of investigating the effect of mindfulness therapy on tolerance of uncertainty, and thought-action fusion in patients with obsessive-compulsive disorder. Method: This was a semi-experimen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PVLDB
دوره 7 شماره
صفحات -
تاریخ انتشار 2014